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CROSS-REFERENCE TO CD-ROM APPENDIX 

[0001] A CD-ROM appendix includes software written in pseudo code for an embodiment of 
a de-skewing method. 

[0002] Volume in drive D is 011005_1635 

Volume Serial Number is C7C0-4943 
Directory of D:\ 

10/05/01 04:35p <DIR> 
10/05/01 04:35p <DIR> 

10/05/01 03:33p 40,448 PSUDOC-^LDOC 

3 File(s) 40,448 bytes 

Total Files Listed: 

3 File(s) 40,448 bytes 

0 bytes free 

[0003] A portion of the disclosure of this patent document contains material which is subject 
to copyright protection. The copyright owner has no objection to the facsimile reproduction by 
anyone of the patent document or patent disclosure, as it appears in the U.S. Patent and 
Trademark Office patent files or records, but otherwise reserves all copyrights whatsoever. 

FIELD OF THE INVENTION 



[0004] The invention relates to semiconductor processing and more specifically to a method 
to de-skew a wafer at any processing step using a single recipe. 
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BACKGROUND 

[0005] Repeated loading of a wafer onto a stage can introduce offsets in the coordinates of a 
location on the wafer where measurements are taken. De-skewing is a method to correct the 
offset. De-skewing is often performed prior to taking metrology measurements of the wafer at 
different steps of wafer processing. 

[0006] To de-skew a wafer, initial coordinates of two or more unique sites ("de-skew sites") 
on a wafer are determined after the wafer is loaded onto a mechanical stage at a first time and 
subsequent coordinates of the two de-skew sites are determined after the wafer (or a different 
wafer) is loaded onto the stage at a subsequent time. A transformation matrix is calculated from 
the initial and the subsequent coordinates of the de-skew sites. The transformation matrix is 
used to convert between the initial and subsequent coordinates of a measurement location. Thus 
measurements can be taken at the same locations on the wafer with convenience and accuracy. 

[0007] A conventional pattern recognition system can be used in the de-skew process. The 
conventional pattern recognition system uses a "recipe" that includes one image of each de-skew 
site of the wafer after one specific process (i.e., for a specific layer of the wafer). During certain 
wafer processes such as chemical mechanical planarization (CMP), pattern images of the de- 
skew sites may change so significantly that the pattern recognition system is unable to find the 
de-skew sites using one recipe. Thus conventionally the pattern recognition system uses a 
different recipe after each specific process (i.e., for each specific layer of the wafer). 

SUMMARY OF THE INVENTION 

[0008] In accordance with embodiments of the invention, a method for processing wafers 
includes learning a first pattern at a de-skew site on a first wafer layer, saving the first pattern in 
a recipe for de-skewing wafers, learning a second pattern at the de-skew site on a second wafer 
layer, and saving the second pattern in the same recipe for de-skewing wafers. Teaming the first 
pattern may include determining a score of uniqueness for the first pattern. Saving the first 
pattern may be performed when the score of uniqueness is greater than a threshold value. 

[0009] In one embodiment, the method fiirther includes finding the de-skew site on the 
second wafer layer using the first pattern before learning the second pattern. Finding the de- 
skew site may include determining a score of similarity between the first pattern and the second 
pattern. Learning the second pattern may be performed when the score of similarity is less than 
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a threshold value. 

[0010] In accordance with embodiments of the invention, a recipe for de-skewing wafers 
includes a plurality of patterns of a de-skew site on a surface of a wafer, wherein the plurality of 
patterns include a first pattern at the de-skew site of a first wafer layer and a second pattern at the 
de-skew site on a second wafer layer, 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0011] FIG. 1 illustrates a method for creating a single recipe for a wafer at any stage of 
wafer processing in accordance with embodiments of the invention. 

[0012] FIG. 2 illustrates a method for learning a pattern in the method in FIG, 1 in one 
embodiment. 

[0013] FIG. 3 illustrates a method for finding a de-skew site using a learned pattern in an 
action of the method in FIG. 1 in one embodiment. 

[0014] FIG. 4 illustrates a method for using the recipe created in the method of FIG. 1 in one 
embodiment. 

[0015] FIGS. 5A and 5B illustrate an exemplary calibration wafer with various de-skew 
sites. 

[0016] FIGS. 6A, 6B, 6C, 6D, 6E, and 6F illustrate exemplary patterns of a de-skew site at 
various stages of wafer processing. 

DETAILED DESCRIPTION 

[0017] FIG. 5A shows a wafer 500 with a measurement site 501 and de-skew sites 502-i and 
502-n (where l<i<n) placed on a mechanical stage 504 (shown schematically only in FIG. 5A). 
FIG. 5B shows wafer 500 placed on the stage at a later time with offsets to sites 501, 502-i, and 
502-n where the original positions of wafer 500 are shown with broken lines. The coordinates of 
sites 501, 502-i and 502-n have respectively changed fi-om (X0,Y0) to (X0',Y0'), from (X1,Y1) 
to (XI ',Y1 and fi-om (X2,Y2) to (X2',Y2')- FIGS. 6A-6F show patterns of, e.g., de-skew site 
502-i after various processes. 

[0018] FIG. 1 illustrates a calibration method 100 in embodiments of the invention for 
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creating a single recipe for de-skewing wafer 500 at any stage in the processing of wafer. In one 
embodiment, method 100 is performed for each of de-skew sites 502 -i and 502-n. 

[001 9] A pattern recognition system, or a user, or a combination of the pattern recognition 
system and the user can perform embodiments of method 100. The pattern recognition system 
can be standalone or part of a metrology measurement system used to characterize wafers before, 
during, and after wafer processing. The measurement system is, e.g,, NanoSpec 9100 available 
from Nanometrics, Inc., ofMilpitas, CaUfomia. 

[0020] In action 101 of method 100, all wafer layers of a calibration wafer (e.g., wafer 500 in 
FIG. 5 A) are put into a group B. Group B includes all wafer layers where de-skew sites 502-i 
and 502-n have not been learned or can be recognized (describe later in action 112). In one 
embodiment, the user puts the wafer layers in group B. 

[0021] in action 102, a pattern is selected at a de-skew site (e.g., de-skew site 502-i) on a 
wafer layer in group B. Hereafter the surface of a wafer after a specific process is referred to as 
a "wafer layer." For example, the surface of the wafer after a first process is called a first wafer 
layer, the surface of the wafer after a second process is called a second wafer layer, and so on. 

[0022] In one embodiment, the pattern of de-skew site 502-i on a particular wafer layer (e.g.^ 
wafer layer 6 on FIG. 6F) is selected because the pattern is the most similar to patterns on de- 
skew site 502-i on other wafer layers (e.g., wafer layers 1-5 in FIGS. 6A-6E). By selecting a 
pattern that is most similar with the other patterns, the number of patterns that must be later 
learned to recognize the same de-skew site on different wafer layers is reduced. 

[0023] In one embodiment, a unique pattern with good image contrast is selected. In one 
embodiment, de-skew sites 502-i and 502-n are selected to be as far away from each other in 
distance as possible. In one embodiment, the user visually selects a pattern for its uniqueness 
and similarity with the other patterns based on experience. 

[0024] In action 103, the selected pattern is learned. FIG. 2 shows one embodiment of action 
103. Alternatively, the selected pattern is learned by conventional pattern recognition 
techniques. 

[0025] In FIG. 2, action 202, the pattern is received. In one embodiment, the pattern 
recognition system receives the pattern from an input device such as a digital camera. 
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[0026] In action 204, a score is given to the pattern for its uniqueness (e.g., ease of 
recognition without error). In one embodiment, the pattern recognition system uses a software 
called "MTI Precision" available from Mathematical Technologies, Inc. of Providence, Rhode 
Island, to determine the uniqueness score (e.g., 0 to 100). MTI Precision receives the pattern in 
grayscale pixels and outputs a score. Alternatively, the score may be given according to a 
conventional pattern recognition method. 

[0027] In action 206, a determination is made as to whether or not the score of the pattern is 
higher than a uniqueness threshold value. If no, then action 206 is followed by action 208. If 
yes, then action 206 is followed by action 212. In one embodiment, the pattern recognition 
system makes the determination while the user sets the uniqueness threshold value. The 
threshold value is, e.g., 80. 

[0028] In action 208, a determination is made as to whether or not to edit the pattern. If yes, 
action 208 is followed by action 210 where the pattern is edited. If no, action 214 is followed by 
action 211 that indicates a failure to learn the pattern. If the pattern at de-skew site 502-i is not 
learned, then another de-skew site needs be to selected and the previously learned patterns at de- 
skew site 502-i are discarded (described later in action 105 of FIG. 1). In one embodiment, the 
user makes the determination whether or not to edit the pattern. 

[0029] In action 210, the pattern is edited. In one embodiment, the user selects unwanted 
features from the pattern and instructs the pattern recognition system to remove the unwanted 
features (e.g., by creating a mask to remove the imwanted features). The unwanted features may 
be uncommon between the patterns at the same de-skew site or reduce the uniqueness score of 
the pattern. The user may select to edit the pattern of the current de-skew site in action 210 to 
avoid selecting a new de-skew site (described later in action 105). Action 210 is followed by 
action 202, and the previously described actions cycle until a pattern with a uniqueness score 
above a threshold is created in action 210 or another de-skew site is selected in action 105 that 
follows action 211. 

[0030] In action 212, the selected pattern is shown. In one embodiment, the pattern 
recognition system shows the selected pattern to the user on a video display. 

[0031] In action 214, a determination is made as to whether or not the pattern includes 
unwanted features. If yes, action 214 is followed by the previously described action 210 where 
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the unwanted features are removed. If no, action 214 is followed by action 216. In one 
embodiment, the user makes the determination whether the pattern includes unwanted features 
based on experience, 

[0032] In action 216, the pattern is saved as a learned pattern. In one embodiment, the 
pattern recognition system uses MTI Precision to save the original pattern and the mask, if any, 
in a file, and a pointer (e.g., a file name) to the file in a single recipe in memory. Saving the 
pointer to the original pattern and the mask, if any, may be considered the equivalent of saving 
the original pattern and the mask in the recipe. Action 216 is followed by action 218 that 
indicates the pattern has been successfully learned. 

[0033] Retiuning to action 104 of FIG. 1, a determination is made as to whether or not the 
pattern has been successfully learned. If yes, then action 104 is followed by action 106. If no, 
then action 104 is followed by action 105. 

[0034] In action 105, a pattern at a new de-skew site (e.g., de-skew site 502-n) on the same 
wafer layer is selected. In one embodiment, the previously learned patterns for the old de-skew 
site (e.g., de-skew site 502-i) are discarded. In one embodiment, the user selects the new de- 
skew site according to the factors described in action 102 and the pattern recognition system 
erases the previously learned patterns at the old de-skew site from the recipe in memory. Action 
105 is followed by the previously described action 103 where the pattern is learned. 

[0035] In action 106, the location of the saved pattern is saved. In one embodiment, the 
pattern recognition system saves the current location of the center of de-skew site 502-i, (XI, 
Yl), in the recipe in memory. The location of de-skew site 502 ~i is only saved in the first 
iteration of method 100 or when a new de-skew site is selected in action 105. In one 
embodiment, the location is based on the coordinate system of stage 504. 

[0036] In action 108, the wafer layer is placed in a group A. Group A includes all wafer 
layers where de-skew site 502-i has been learned or can be recognized (describe later in action 
112). In one embodiment, the user puts the wafer layers in group A. 

[0037] In action 1 1 0, a wafer layer in group B is loaded and positioned so de-skew site 502-i 
can be examined. In one embodiment, a loader conventionally loads the wafer onto a stage of 
the pattern recognition system and the user instructs the stage to position the wafer layer so the 
digital camera can capture the pattern at de-skew site 502-i. 
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[0038] In action 112, the pattern at de-skew site 502-i of the wafer layer is compared with 
the learned pattern. Although a single wafer at different steps of a manufacturing process may 
be used in actions 110 and 112, multiple wafers at different processing steps of the same 
manufacturing process can also be used. FIG. 3 shows one embodiment of action 112. 
Alternatively, the patterns are found by conventional pattern recognition techniques. 

[0039] In FIG. 3, action 302, the pattern is received. In one embodiment, the pattern 
recognition system receives the pattern from the digital camera. 

[0040] In action 304, the pattern is compared with each learned pattern in the recipe. The 
pattern is given a score of similarity with each learned pattern indicating how closely the two 
patterns match. In one embodiment, the pattern recognition system uses the software MTI 
Precision to determine the score of similarity (e.g., 0 to 100). MTI Precision may also output an 
offset from the original location of the learned pattern (e.g., from (XI, Yl)). MTI Precision 
software receives grayscale pixels of the pattern and outputs a score and an offset for each 
learned pattern. Alternatively, the score and the offset may be given according to a conventional 
pattern recognition method. 

[0041] In action 306, the highest score generated from the learned patterns is selected. In 
one embodiment, the pattern recognition system selects the highest score for the pattern. In the 
initial iteration of method 100, there is only one learned pattern and therefore no selection is 
necessary. 

[0042] In action 308, a determination is made as to whether or not the highest score is greater 
than a similarity threshold value. In one embodiment, the pattern recognition system makes the 
determination and the user sets the similarity threshold value. In one embodiment, the similarity 
threshold value is saved in the recipe. The similarity threshold value ranges, e.g., from 65 to 80. 
If the highest score is not greater than the similarity threshold value, then action 308 is followed 
by action 310. If the highest score is greater than the similarity threshold value, then action 308 
is followed by action 312. 

[0043] In action 310, the pattern is recognizable because it has been successfiiUy matched 
with a learned pattern. In action 312, the pattern is not recognizable because it has not been 
matched with learned patterns. In one embodiment, the pattern recognition system saves the 
results of actions 310 and 312 (the result of the determination in action 308) in memory. 
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[0044] Returning to action 1 1 4 of FIG. 1 , a determination is made whether or not all the 
wafer layers in group B have been compared with the learned patterns. If yes, then action 1 14 is 
followed by action 116. If no, then action 1 14 is followed by action 110 and method 100 cycles 
imtil all wafer layers in group B have been compared with the learned patterns. In one 
embodiment, the user makes this determination. 

[0045] In action 116, the wafer layers that have a recognizable de-skew site 502-'i are placed 
in group A, and the wafer layers that have an unrecognizable de-skew site 502 -i are placed in 
group B. In one embodiment, the user puts the wafer layers in groups A and B. 

[0046] In action 120, a determination is made as to whether or not there are wafer layers in 
group B. If yes, then action 120 is followed by action 122. If no, then action 120 is followed by 
action 130, which ends method 100 because all wafer layers have de-skew site 502-i 
recognizable by a single recipe. In one embodiment, the user makes this determination, 

[0047] As previously discussed, method 100 can be repeated for additional de-skew sites 
(e.g., de-skew site 502-n) so the single recipe can be used for multiple de-skew sites. In one 
embodiment, the pattern recognition system performs method 100 for multiple de-skew sites in 
parallel. If the same patterns are formed at different de-skew sites of the same wafer layer, then 
new patterns do not need to be learned. However, the pattern recognition system would still 
need to find and save the locations of these additional de-skew sites by, e.g., using the learned 
patterns. 

[0048] In action 122, a pattern of de-skew site 502-i on a wafer layer in group B (e.g., wafer 
layer 2 on FIG. 6B) is selected. In one embodiment, the selected pattern of de-skew sites 502-i 
on this wafer layer is the most similar to patterns of de-skew site 502-i on the other wafer layers 
in group B. In one embodiment, the pattern is selected for like reasons in action 102. In one 
embodiment, the user selects the pattern. 

[0049] In action 124, the wafer layer is loaded and moved to examine de-skew site 502-i. In 
one embodiment, action 124 is similar to action 1 10. 

[0050] In an optional action 126, the offset of de-skew site 502-i is calculated using the 
learned patterns in the recipe regardless of the similarity threshold value, and de-skew site 502-i 
is aligned according to the offset. In one embodiment, the pattern recognition system uses the 
software MTI Precision to output an offset from the last learned pattern and the user determines 
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if the offset is reasonable. If the offset is reasonable, the user instructs the pattern recognition 
system to align de-skew site 502-i according to the offset (e.g., moving the wafer layer so the 
digital camera is directly above the center of de-skew site 502-i to capture the pattern). If the 
offset is not reasonable, the user visually aligns de-skew site 502-i (e.g., in action 128). MTI 
Precision software receives grayscale pixels of the pattern and outputs the offset. Alternatively, 
the offset may be determined by a conventional pattern recognition method. 

[0051] In action 128, the alignment of de-skew site 502-i is visually adjusted. In one 
embodiment, the user makes fine visual adjustment by instructing the stage to position the wafer 
layer so the digital camera is directly above the center of de- skew site 502-i to capture the 
pattern. Action 128 is followed by the previously described action 103 where the pattern is 
learned. 

[0052] In one embodiment of action 130, the recipe may hold only one pattern that can be 
used to recognize the same de-skew site on multiple wafer layers. In another embodiment of 
action 130, the recipe may hold multiple patterns where each pattern can be used to recognize the 
same de-skew site on at least one wafer layer. In yet another embodiment of action 130, the 
recipe may hold multiple patterns where each pattern can be used to recognize the same de-skew 
site on multiple wafer layers. 

[0053] FIG. 4 shows a method 400 for using the single recipe generated in method 100. 
Similar to method 100, a pattern recognition system, or a user, or a combination of the pattern 
recognition system and the user can perform embodiments of method 400. The pattern 
recognition system can be standalone or part of a metrology measurement system used to 
characterize wafers before, during, and after wafer processing. 

[0054] In action 402, the recipe generated in method 100 is loaded. In one embodiment, the 
pattern recognition system loads the recipe into memory (e.g., retrieves from a database on a 
hard disk into RAM). 

[0055] In actions 404 and 406, a wafer at any processing step (e.g., any wafer layer) is 
loaded and positioned so a de-skew site (e.g., de-skew site 502-i') can be examined. In one 
embodiment, the wafer layer is positioned to the expected location (e.g., saved location) of de- 
skew site 502-i. Actions 404 and 406 are like the previously described action 1 10. 

[0056] In action 408^ a search for de-skew site 502-i' of the wafer layer is performed by 

-9- 



M-n928 US 
775203 



comparing patterns on the wafer layer with the learned patterns in the recipe, hi one 
embodiment, the pattern recognition system uses MTI Precision to find de-skew site 502 -i' on 
the wafer layer. Alternatively, de-skew site 502-i' may be searched according to a conventional 
pattern recognition method. 

[0057] hi one embodiment of method 400, wafer 500 may be loaded so the current locations 
(e.g., (Xr,Yr) and (X2',Y2') in FIG. 5B) and the saved locations (e.g., (XI, Yl) and (X2,Y2) in 
FIG. 5B) of de-skew sites 502-i and 502 -n are very distant. Thus in action 408 when the digital 
camera is placed over the saved locations, the captured pattern may not completely include the 
pattern of de-skew sites 502-i or 502-n. Thus, the pattern recognition system needs to search in a 
broader area for the patterns of de-skew sites 502-i and 502-n. 

[0058] In one embodiment illustrated in FIG. 5B, the pattern recognition system starts the 
search at the saved location of one of the de-skew sites (e.g., the saved location of de-skew site 
502-n) and captures patterns in an outwardly spiraling route 506. MTI Precision determines 
from each pattern whether there is a match with the learned patterns in the recipe (e.g., like 
action 112). 

[0059J In a second pass through action 408 (once de-skew site 502-i' has been found), the 
pattern recognition system searches for de-skew site 502-n' on a circular perimeter 508 having a 
center at the location of de-skew site 502-i' and a radius R equal to the distance between de-skew 
sites 502-i and 502-n. In one embodiment, the pattern recognition system sweeps in incremental 
steps in alternating directions. The first search point on parameter 508 may be in the angle 
which the saved locations of de-skew sites 502-i and 502-n were oriented. 

[0060] In action 41 0, a determination is made as to whether or not de-skew site 502-i' has 
been recognized. If no, action 410 is followed by action 412. If yes, action 410 is followed by 
action 414, 

[0061] In action 412, the wafer is rejected because it cannot be de-skewed. In one 
embodiment, the pattern recognition system outputs an alarm that the wafer cannot be de-skewed 
to the user. 

[0062] In action 414, a determination is made whether or not the last de-skew site has been 
recognized. If yes, then action 414 is followed by action 416. If no, action 414 is followed by 
the previously described action 406 where another de-skew site is examined. In one 
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embodiment, the pattern recognition system makes this determination (e.g., if the second de- 
skew site has been recognized). 

f0063] In action 416, a transformation matrix is determined. In one embodiment, the pattern 
recognition system uses a conventional Gauss elimination algorithm to calculate the 
transformation matrix from the saved coordinates of de-skew sites 502 -i and 502-n (e.g., 
(X1,Y1) and (X2, Y2) in FIG. 5B) in the recipe and the just-determined coordinates of de-skew 
sites 502-i' and 502-n' (e.g., (Xr,Yr) and (X2%Y2') in FIG. 5B). 

[0064] In action 41 8, a measurement is taken at predetermined measurement site SOT on 
wafer 500 using the transformation matrix. In one embodiment, the pattern recognition system 
calculates new coordinates of measurement site 501' (XO',YO') from the known coordinates of 
measurement site 501 (XO,YO) using the transformation matrix. A metrology tool takes the 
measurement at the new coordinates of (X0',Y0') of measurement site 501'. 

[0065] Table 1 illustrates an embodiment of the data structure of the recipe generated in 
method 100. 

10066] Table 1 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiitiititiiiiiiiiiiiiiiiiiiiiiitiiiiiiiiiiiiiiin 

II Deskew recipe data structure 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllin 
define MAX_NUM_FILES = 10 

struct DeskewRecipe 
{ 

int m_nScoreThreshold; //second score threshold, user specified 

int m_nNumPRFiles; //number of learnt pattern files for each deskew site. Must 

MAX__NUM_DESKEW_PRFILES 
C String m_strDeskewName[2]; //deskew names that user specified for two deskew sites 
double m_dX[2]; I Ik coordinates in mm for two deskew sites 
double m_dY[2]; fly coordinates in mm for two deskew sites 

CString m_strPRFileName[2][MAX_NUM_DESKEW_PRFILES]; 

//File names for MTI precision saving learned pattem(s) information 
includes mask, if any, for two deskew sites. The filenames are 
generated by software based on m_strDeskewName[i]. 

//for deskew find only 

double m_dCurrX[2]; //current x coordinates in mm for two deskew sites 

double m_dCurrY[2]; //current y coordinates in mm for two deskew sites 

}; 

[0067] Although the invention has been described with reference to particular embodiments. 
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the description is a representative example and should not be taken as a limitation. For example, 
the actions 1 14 to 120 can be automated (e.g., performed by the pattern recognition system 
instead of the user), hi addition, more than two de-skew sites can be used to de-skew wafer 500. 
Various other adaptations and combinations of features of the embodiments disclosed are within 
the scope of the invention as defined by the following claims. 
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